Tagging accurately - Don't guess if you know

نویسندگان

  • Pasi Tapanainen
  • Atro Voutilainen
چکیده

We discuss combining knowledge-based (or rule-based) and statistical part-of-speech taggers. We use two mature taggers, ENGCG and Xerox Tagger, to independently tag the same text and combine the results to produce a fully disambiguated text. In a 27000 word test sample taken from a previously unseen corpus we achieve 98.5 % accuracy. This paper presents the data in detail. We describe the problems we encountered in the course of combining the two taggers and discuss the problem of evaluating taggers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Optimization Problem with a Surprisingly Simple Solution

Suppose you and n of your friends play the following game. A random number from the uniform distribution on [0, 1] will be generated. This number is called the target. Each of you will independently guess what the target number will be and the person whose guess is closest will be declared the winner. In order to investigate an optimal strategy for this game, we need to assume something about y...

متن کامل

Microsimulation of complex system dynamics: automata models in biology and finance

" For the things we have to learn before we can do them, we learn by doing them. " Aristotle (384-322 BC), Nichomachean Ethics " First you guess. Don't laugh, this is the most important step. Then you compute the consequences. Compare the consequences with experience. If it disagrees with experience, the guess is wrong. In this simple statement is the key to science. " Richard P. Feynmann " If ...

متن کامل

And Co - Leader of the First Large Scale Genomic Study of Healthy Aging

You compare the genomes of the 'Wellderly' (people over 80 who don't have any major age-related disease) to the genomes of the ITMI cohort. But this ITMI cohort seems like an odd choice. First, they aren't representative of the general population: they have lower BMIs and higher degrees of educational attainment. Second, they are an average of 33 years old, so you don't know if they are 'normal...

متن کامل

Scleroderma lung disease: "if you don't know where you are going, any road will take you there".

" If you don't know where you are going, any road will take you there. " It is one of my favorite sayings. It always reminds me that if you and I don't have a plan of some kind, then we will wander aimlessly toward whatever. And generally, if we have not thought out where we are going, or where we would like to go, then other people, or life circumstances, will create the way for us. I heard th...

متن کامل

Interviewing witnesses: the effect of forced confabulation on event memory.

After viewing a crime video, participants answered 16 answerable and 6 unanswerable questions. Those in the "voluntary guess" condition had a "don't know" response option; those in the "forced guess" condition did not. One week later the same questions were answered with a "don't know" option. In both experiments, information generated from forced confabulation was less likely remembered than i...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994